PATENT COOPERATION TREATY 

From the 



INTERNATIONAL SEARCHING AUTHORITY 



To: 

HADASSA WATERMAN 
G.E. EHRLICH (1995) LTD. 
1 1 MENACHEM BEGIN STREET 
RAMATGAN, 52521 
ISRAEL 


PCX 

WRITTEN OPINION OF THE 
(PCTRule43to.l) 


^M^^lr^ ft5? 5^FP 2009 


Applicant*s or agent's file reference 
33103 


FOR FURTHER ACTION 

Sec paragraph 2 below 


Internationa] application No. 
PCT/lL07yOOl23 


International filing date (day/month/year) 
31 January 2007 (31.01.2007) 


Priority date {dcy/monih/year) 
31 January 2006 (31.01.2006) 


International Patent Classification (IPC) or both nauonal classification and IPC 

IPC: G06F 17y28( 2006.01) 
USPC: 704/2,4,7,8 


Applicant 

RAMOT AT TEL AVIV UNIVERSITY LTD. 



1. This opinion contains indications relating to the fol (owing items: 



Basis of the opinion 
Priority 

Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 
Lack of unity of invention 

Reasoned statement under Rule 436ir. I (a)(i) with regard to novelty, inventive step or industrial 
applicabili^, citations and explanations supporting such statement 

Certain docunnents cited 

Certain defects in the international ^plication 

Certain observations on the international application 

2. FURTHER ACTION 

If a demand for international preliminary examination is made, this opinion wUl be considered to be a written opinion of the 
International Preliminary Examining Authority ("IPEA") except that this does not apply where the applicant chooses an 
Authority other than this one to be the IPEA and the chosen IPEA has notified the International Bureau under Rule t^.\bis(b) 
that written opinions of this International Searching Authority will not be so considered 





Box No. 1 


□ 


Box No. 11 
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Box No. Ill 
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Box No. IV 




Box No. V 
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Box No. VI 


□ 


BoxNo. vn 


□ 


Box No. VIII 



If this opinion is, as provided above, considered to be a written opinion of the IPEA, the applicant is invited to submit to the 
IPEA a written reply together, where appropriate, with amendments, before the expiration of 3 months from the date of mailing 
of Form PCT/ISA/220 or before the expiration of 22 months from the priority date, whichever expires later. 
For further options, see Form PCT/lSA/220. 
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Box No. I Basis of this opinion 



1. With regard to the language, this opinion has been established on the basis of: 
^ the international application in the language in which it was filed 

□ a translation of the international application into . >vhich is the language of a translation fiimished for the purposes of 

international search (Rules 12.3(a) and 23.1(b)). ^ j,^ 

2 □ ^^T^pinion has been established t^^^^ 

3. With r^^^l:'::^^^^^^ «que«^ <^*«<^-- - ^ -^^^^ -P^"^^"- 

established on the basis of: 

a. type of material 
Q a sequence listing 
ri tablc(s) related to the sequence listing 

b. format of material 
I I on paper 
I I in electronic form 

c. time of filing^fumishing 

□ contained in the international application as filed. 
n filed together with the international application in electronic form. 

□ fiimished subscquenUy to this Authority for the puiposes of search. 

4 □ inadditionjnthecasethatmorethanoneversionorcopyofasequencelistin^ 

fomished, the required statements that the information in the subsequent or additional copies is identical to that m the 
application as filed or does not go beyond the application as filed, as appropruite. were furnished. 

5. Additional comments: 



Form PCrr/ISAi^37(Box No. I) (April 2007) 





International application No. 


WRITTEN OPINION OF THE 


PCT/IL07/00123 


INTERNATIONAL SEARCHING AUTHORITY 





Box No. V Reasoned statement under Rule 43 fri!sJ(aJ(i) with regard to novelty, inventive step or industrini 
applicabili^; citations and explanations supporting snch statement 



1, Statement 

Novelty (N) Claims hSQ— 

Claims NONE 

Inventive step (IS) Claims NONE 

Claims N50 

Industrial applicability (lA) Claims 1-50 

Claims NONE ^NO 



2. Citations and explanations: 
Please See Continuation Sheet 
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V. 2. Citations and Exphinations: , , * tc< < ^01 noi^ 

1 . Claims 1-35 lack an inventive step under PCT Article 33(3) as being obvious over Nakayama ct al. (Nakayama. US 5.687,383) 

in view of Brown el al. (Brown, US 5^293.584). 

As per claims I and n, Nakayama teaches a method of constnicting translation rales using a first cfataset of semw^ 
language and a second dataset of sentences in a tai^et Imiguage. the translation rules being for translating the source language into Uie 
target language, the method comprising: ^. ^ ^. ^ ^ v j 

acquiring at least a partial source grammar characterizing the source language (Fig. 8A-his parse of source, and grammar) and 
at least a partial target grammar characterizing the target language (Fig. 8b-his parse of target and grammar); 

for each sentence of the first dataset, generating a mapping function for mapping an ordered set of source grammar symbols 
being associated with said sentence to an ordered set of target grammar symbols, thereby providing a set of mapping functions (Fig. 5 
item ST 14-his corresponding candidates); and . , - , . « i^x 

archiving said set of mapping functions in a database^ thereby constracting the U-anslation rules (C.4 Imes 9,10). 

Nakayama docs not explicitly teach translating utterances of a source language into utterances of a target language. However. 
Brown teaches translating utterances... (C.2 lines 11-24) , ... ^ . ^ 

Therefore, at the lime of the invention, it would have been obvious to one ordinarily skilled m the art to modify Nakayama 
with Brown, providing the benefit of speech input for translation. 

As per claims 2 and 1 8, Nakayama and Brown make obvious the method of claim 1 , wherein said generating said mapping 
fijnction is by a machine readable dictionary <C4 lines 5-IO-his machine readable dictionary). 

As per claims 3 and 19. Nakayama and Brown make obvious the method of claim I, further comprising archiving said at least 
partial source grammar and said at least partial source target grammar in said database (Fig. 1, item 3 his rule/grammar dictionary). 

As per claims 4 and 20, Nakayama and Brown make obvious the method of claim I, wherein said generating said mapping 
function comprises, for each dataset of the fust and second datasets, parsing each sentence of said dataset to provide an ordered set of 
symbols covering said sentence (Fig. 8A, 8B-his parse), and updating said mapping Amctions based on said ordered set of symbols (C.4 
lines 9 and 10). 



p^r rlaiim ^ and ^ 1 N^^^Byama and R f^wn iraike nhviftiig the inftthnd ftf clftim 1 further COmnrising. COnStructinP a SQWCC 
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probability matrix from the source grammar (Fig. 8A-his probability parse. Fig. 16-his matrix and score) and a target probability matrix 
from the target grammar, each probability matrix (Fig. 8b-his probability parse. Fig. 16-his matrix and score) comprising a plurality of 
entries representing co-occurrence probabilities of symbols in a respective gramraar (Fig. 8A, 8B, wherein the co-occurrence of his 
symbol with respect to grammar is generated in the parse. Fig. 16-and generated score). 

As per claims 6 and 22, Nakayama and Brown make obvious the method of claim S, further comprising, for each dataset of the 
first and second datasets, parsing each sentence of said dataset to provide an ordered set of symbols covering said sentence (Fig. 8A, 8B- 
his parse), and updating a respective probability matrix based using said ordered set of symbols (Fig 16-updated and scored by parse and 
probability). 

As per claims 7 and 23, Nakayama and Brown make obvious the method of claim 6, frirther comprising updating said set of 
mapping functions using said probability matrices (C.7 lines 5-14-his mapping updated tiased on estimation). 

As per claims 8 and 24, Nakayama and Brown make obvious &e method of claim 1 , wherein each of said source grammar and 
said target grammar independently comprises: 

terminals being associated with tokens of a lexicon characterizing said dataset and nonterminals being associated with 
equivalence classes of tokens of said lexicon and/or significant patterns of a respective dataset (Fig. 2, his terminals and nonterminals in 
eac^ respective dataset. Fig. l-his bilingual texts, C.4 lines 55-67-his original sentence and translated sentence, analyzed with respect to 
respective dictionary/lexicon). 

As per claims 9 and 25. Nakayama and Brown make obvious the method of claim 8, wherein said acquiring said source 
grammar and said target granmiar, comprises for each sentence of the first dataset and for each sentence of the second dataset. searching 
for partial overlaps between said sentence and other sentences of the respective dataset (Fig. i7-his case frame and regional 
determinatian), applying a significance test on said partial overlaps (Fig. 18-his estimation of case Hller), and deHning a most stgnificont 
partial overlap as a significant pattern of said sentence, thereby extracting significant patterns from the first and the second datasets, 
thereby acquiring nonterminals for said source grammar and said target grammar (Fig l9-his extracted significant patterns). 

As per claims 10 and 26, Nakayama and Brown make obvious the method of claim 8, wherein said acquiring said source 
grammar and said target grammar, comprises: 

for each dataset of the Hrst dataset and the second dataset: searching over said dataset for similarity sets (Fig 9 item ST21-his 
correspondence), each similarity set comprising a plurality of segments of size L (Fig. 8 A, 8B-his JI , J2, J3 components) having L-S 
common tokens and S uncommon tokens (his symbol uncommon tokens), each of said plurality of segments being a portion of a 
difTerent sentence of said dataset; and defming a plurality of equivalence classes corresponding to uncommon tokens of at least one 
similarity set (fig. 13 his ''question" and ^'problem" similarity set from different sentences), thereby acquiring nonterminals for said 
source grammar and said target grammar (Fig. 14-his acquired nonterminal for question). 

As per claims 1 1, 27, 28 and 29, Nakayama and Brown make obvious the method of claim 10, wherein said definition of said 
plurality of equivalence'classes comprises, for each segment of each similarity set: 

extracting a significant pattern corresponding to a most significant partial overlap between said segment and other segments or 
combination of segments of said similarity set, thereby providing, for each similarity set, a plurality of significant patterns (Fig. l7-his 
case information region providing overlap, and Fig. 1 8, bis missing case for the similarity set produced in the corresponding case frame); 
and 

using said pliu^lity of significant patterns for classifying tokens of said similarity set into at least one equivalence class (Fig. 
1 7-his part of speech class, see also Fig 19); 

thereby dcHning said plurality of equivalence classes (ibid, the equivalmce class identified by corresponding part of speech). 

As per claims 12, 29 and 30, Nakayama and Brown make obvious method of claim 1 1, wherein said classification of said 
tokais comprises, selecting a leading significant pattern of said similarity set (Fig. 18-his estimation of missing case and case filler 
determined by similarity set), and defming uncommon tokens of segments corresponding to said leading significant pattern as an 
equivalence class (Fig. 18-his missing case information, the estimation of the case filler directly from the equivalence class). 

As per claims 13 and 31, Nakayama and Brown make obvious the method of claim 1 1, further comprising, prior to said search 
for said similarity sets: 

extracting a plurality of significant patterns from the dataset, each sigtiiiicant pattern of said plurality of significant patterns 
corresponding to a most significant partial overlap between one sentence of the dataset and other sentences of the dataset (Fig. 2 his file 
read and extracted patterns. Fig 3 his template for translation rules including overlaps, see Fig. 6, overlapping cases) ; and 

for each significant pattern of said plurality of significant patterns, grouping at least a few tokens of saki significant pattern, 
thereby redefining the dataset (Fig. 29-his final grouped patterns to be learned, redefining the dataset). 

As per claims 14, 32 and 33, Nakayama and Brown make obvious the method of claim 13, further comprising constructing a 
graph having a plurality of paths representing die dataset (Fig. 8A, 8B), wherein each extraction of significant pattern is by searching for 
partial overlaps between paths of said graph (Fig. 9-Ltem ST2 l-his case frame correspondence overlap). 
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As per claims 15 and 34, Nakayama and Brown make obvious the method of claim 14, wherein said graph comprises a 
plurality of vertices (Fig. 8 A, 8B, his Jl, J2, J3, El, E2. E3), each representing one token of the lexicon (ibid), and further wherein each 
path of said plurality of paths comprises a sequence of venices respectively corresponding to one sentence of the dataset (Fig 8A, 8B-hi5 
sequence of sentence from Figs. 7A, 7B). 

As per claims 16 and 35, Nakayama and Brown make obvious the method of claim 14« further comprising calculating, for each 
path, a set of probability functions characterizing said partial overlaps (Fig. 18 item ST33-his probability/estimation of case filler, based 
on paths/overlap). 

2. Claims 36-50 lack an inventive step under PCT Article 33(3) as being obvious over Nakayama et al. (Nakayama^ US 5,687,383) 
in view of Brown et al. (Brown, US 5,293,584). 



Claims 36-50 lack inventive st^ under PCT Article 33(3) as being obvious over Brown et al. (Brown, US 5,293,584). 

As per claims 36 and 42, Brown teaches a method of translating an utterance from a source language to a target language, 
comprising: 

employing a structured stochastic language model so as to generate from the utterance a plurality of candidate utterances in the 
target language (Fig. 2 item 20-his candidate word generator), and so as to assign a score to each candidate utterance of said plurality of 
candidate utterances (Fig. 1 item 14, item 24); and 

selecting a candidate utterance having an optimal score, thereby translating the utterance from the source language to the target 
language (C.2 lines 32-36, 41-47-his best hypothesis match score translation). 

As per claims 37 ^d 43, Brown teaches the method of claim 36, further comprising accessing a database to obtain at least a 
partial source grammar characterizing the source janguage (C.2 lines 1-IO-his source language model) and at least a partial target 
grammar characterizing the target language (C.2 lines l-lO-his target/translation model), wherein said generation of said plurality of 
candidate utterances is based on said at least partial grammars (ibid, C.2 lines 48-65-his candidate utterances based on the models). 

As per claims 38 and 44, Brown teaches the method of claim 37, further comprising, prior to said generation of said plurality of 
candidate utterances: 

accessing said database to obtain translation rules defined by a set of mapping functions for mapping ordered sets of source 
grammar symbols to ordered sets of target grammar symbols (C.2 lines 5-7-his source language information, mapped to speech 
hypothesis): 

parsing the utterance to obtain an ordered set of source grammar symbols covering the utterance (C.2 lines 14*l8-his sequence 
of coded utterance), thereby providing an utterance cover in said at least partial source grammar (ibid); 

using said nanslanon rules for mapping said utterance cover to an ordered set of target grammar symbols (C.2 lines 32-33-hi5 
translaticm match mapping for the sequence above); and 

assigning prior probabilities in said structured stochastic language model for each target grammar symbol of said grammar 
symbols set (C.2 lines 48-54-his candidate target probabilities). 

As per claims 39 and 45, Brown teaches the method of claim 38. fitrtfaer comprising prior to said selection of said candidate 
utterance, processing said plurality of candidate utterances according to additional ranking criteria (C.2 lines 4N46-his best estimate of 
candidate utterance). 

As per claims 40 and 46. Brown teaches the method of claim 39, whecein said processing comprises, for each candidate 
utterance of said plurality of candidate utterances: 

parsing said candidate utterance to obtain an ordered set of target grammar symbols covering said candidate utterance C.2 lines 
14-1 8-his sequence of coded utterance), thereby providing a candidate utterance cover in said at least partial target granunar (C.2 lines 
4 1-46-his best estimate of candidate utterance as the cover); and 

performing correspondence analysis to provide goodness of correspondence between said candidate utterance cover in said at 
least partial target granunar and said utterance cover in said at least partial source grammar (C.2 lines 55*65-his translation match score 
for the speech hypothesis as the correspondence analysis). 

As per claims 41 and 47, Brown leaches the method of claim 38, wherein said employing said stochastic langus^ model 
con^rises replacing said prior probabilities with conditional probabilities represeming a discourse context of the utterance (C.6 lines 20- 
25-his conditional translation match, and PCT), as replacing P(T). (T*) as generated from the language match score generator). 

As per claim 48, Brown teaches a text processing system having a translator, the translator comprising the apparatus of any of 
claims 42-47 (C.2 lines 5-10-his translation). 

As per claim 49, Brown teaches a text processing system having a style checker, the style checker comprising the apparatus of 
any of claims 42-47 (CI lines 64-67, his language complexity checker). 
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